Aligning letters and phonemes for speech synthesis
نویسندگان
چکیده
A common requirement in speech technology is to align two different symbolic representations of the same linguistic ‘message’. For instance, we often need to align letters of words listed in a dictionary with the corresponding phonemes specifying their pronunciation. As dictionaries become ever bigger, manual alignment becomes less and less tenable yet automatic alignment is a hard problem for a language like English. In this paper, we describe use of a form of the expectation-maximization (EM) algorithm to achieve automatic alignment of English text and phonemes. The quality of alignment is assessed by the performance of a pronunciation by analogy system using the aligned dictionary data. We find excellent performance—the best so far reported in the literature of letter-phoneme conversion—independent of the start point for alignment, indicating that the EM search space is strongly convex.
منابع مشابه
Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کاملEffects of sound pillow in the treatment of stuttering and cognitive phonemes impairment in children
Introduction:Verbal language is Fundamental component for expressing ideas, social interaction and understanding educational materials. Effective communications require verbal language skills. Sound pillows may partly address the children with behavior problems. The purpose of this study was assessing the effect of educational sound pillow in the treatment of stuttering and cognitive phonemes i...
متن کاملOn the Efficacy of a Communicative Framework in Teaching English Phonological Features Absent in Persian to Iranian EFL Learners
Although Persian and English share many common phonemes, there are some phonological features that are present in English but absent in Persian which tend to lead to mispronunciation on the part of Persian learners of English, mostly through negative transfer. The present research assesses the efficacy of a communicative framework in improving Iranian adult EFL learners’ pronunciation of five E...
متن کاملPersian Cued Speech: The Effect on the Perception of Persian Language Phonemes and Monosyllabic Words with and without Sound in Hearing Impaired Children
Objectives: This paper studies the effect of Persian Cued Speech on the perception of Persian language phonemes and monosyllabic words with and without sound in hearing impaired children. Cued Speech is a sound based mode of communication for hearing impaired people that is comprised of a limited series of hand complements and the normal pattern of speech. And it is shown that it effectively ca...
متن کاملSalient phonetic features of Indian languages in speech technology
Speech signal is the basic study and analysis material in speech technology as well phonetics. To form meaningful chunks of language, the speech signal should have dynamically varying spectral characteristics, sometimes varying within a stretch of a few milliseconds. Phonetics groups these temporally varying spectral chunks into abstract classes roughly called as allophones. Distribution of the...
متن کامل